The Ins and Outs of Dutch noun countability classification

نویسندگان

  • Timothy Baldwin
  • Leonoor van der Beek
چکیده

This paper presents a range of methods for classifying Dutch noun countability based on either Dutch or English data. The classification is founded on translational equivalences and the corpus analysis of linguistic features which correlate with particular countability classes. We show that crosslingual classification on the basis of word-to-word or featureto-feature mappings between English and Dutch performs at least as well as in-language classification based on gold-standard Dutch countability data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Crosslingual Countability Classification with EuroWordNet

We examine the hypothesis that noun countability is consistent for a given word semantics by way of a series of experiments involving EuroWordNet and the English and Dutch languages. The basic method involves determining a default set of countabilities for each EuroWordNet synset based on countability-mapped words in that synset, and testing the match between these countabilities and those of h...

متن کامل

Crosslingual Countability Classification: English meets Dutch

This paper presents a range of methods for classifying Dutch nouns as countable, uncountable or plural only based on both Dutch and English data. The classification is based on the occurrence of countability specific linguistic features that are extracted from unannotated corpora. We show that in the absence of reliable Dutch gold standard data, cross-linguistic classification can be achieved o...

متن کامل

Mass counts in World Englishes: A corpus linguistic study of noun countability in non-native varieties of English

Research on the morpho-syntax of non-native varieties of English has reported a widespread presence of mass noun pluralization such as baggages, equipments and softwares. In this paper we conducted a corpus linguistic study in order to provide empirically substantiated answers to this claim. We examined the purported prevalence of noun countability in World Englishes in a 1.9 billion-token mega...

متن کامل

A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon

The present paper describes the current release of the Bochum English Countability Lexicon (BECL 2.1), a large empirical database consisting of lemmata from Open ANC (http://www.anc.org) with added senses from WordNet (Fellbaum, 1998). BECL 2.1 contains ≈ 11,800 annotated noun-sense pairs, divided into four major countability classes and 18 fine-grained subclasses. In the current version, BECL ...

متن کامل

Artifact Nouns: Reference and Countability

Formal semantic treatments of nominal reference have focused on natural kind nouns (Carlson 1980, Krifka et al. 1995), yet as has been widely recognized by anthropologists, cognitive scientists, philosophers, and others, artifacts (cup, furniture, chair) and natural kinds (dog, water, flower) differ critically in nature. We argue that these differences have important consequences for how artifa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003